Lower bounds on frequency estimation of data streams

نویسنده

  • Sumit Ganguly
چکیده

We consider a basic problem in the general data streaming model, namely, to estimate a vector f ∈ Z that is arbitrarily updated (i.e., incremented or decremented) coordinatewise. The estimate f̂ ∈ Z must satisfy ‖f̂ − f‖∞ ≤ ǫ‖f‖1, that is, ∀i (|f̂i − fi| ≤ ǫ‖f‖1). It is known to have Õ(ǫ) randomized space upper bound [6], Ω(ǫ log(ǫn)) space lower bound [4] and deterministic space upper bound of Ω̃(ǫ) bits. We show that any deterministic algorithm for this problem requires space Ω(ǫ−2(log‖f‖1)(logn)(log (ǫ)) bits.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Streams with Bounded Deletions

Two prevalent models in the data stream literature are the insertion-only and turnstile models. Unfortunately, many important streaming problems require a Θ(log(n)) multiplicative factor more space for turnstile streams than for insertion-only streams. This complexity gap often arises because the underlying frequency vector f is very close to 0, after accounting for all insertions and deletions...

متن کامل

Better Bounds for Frequency Moments in Random-Order Streams

Estimating frequency moments of data streams is a very well studied problem [1–3,9,12] and tight bounds are known on the amount of space that is necessary and sufficient when the stream is adversarially ordered. Recently, motivated by various practical considerations and applications in learning and statistics, there has been growing interest into studying streams that are randomly ordered [3,4...

متن کامل

Robust Identification of Smart Foam Using Set Mem-bership Estimation in A Model Error Modeling Frame-work

The aim of this paper is robust identification of smart foam, as an electroacoustic transducer, considering unmodeled dynamics due to nonlinearities in behaviour at low frequencies and measurement noise at high frequencies as existent uncertainties. Set membership estimation combined with model error modelling technique is used where the approach is based on worst case scenario with unknown but...

متن کامل

Lower Bounds on Frequency Estimation of Data Streams (Extended Abstract)

We consider a basic problem in the general data streaming model, namely, to estimate a vector f ∈ Z that is arbitrarily updated (i.e., incremented or decremented) coordinatewise. The estimate f̂ ∈ Z must satisfy ‖f̂ − f‖∞ ≤ ‖f‖1, that is, ∀i (|f̂i − fi| ≤ ‖f‖1). It is known to have Õ( −1) randomized space upper bound [4], Ω( −1 log( n)) space lower bound [2] and deterministic space upper bound of ...

متن کامل

Distinct-Values Estimation over Data Streams

In this chapter, we consider the problem of estimating the number of distinct values in a data stream with repeated values. Distinctvalues estimation was one of the first data stream problems studied: In the mid-1980’s, Flajolet and Martin gave an effective algorithm that uses only logarithmic space. Recent work has built upon their technique, improving the accuracy guarantees on the estimation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008